Cryptotomography: reconstructing 3D Fourier intensities from randomly oriented single-shot 

diffraction patterns 
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We reconstructed the 3D Fourier intensity distribution of mono-disperse prolate nano-particles using single- 
shot 2D coherent diffraction patterns collected at DESY's FLASH facility when a bright, coherent, ultrafast 
X-ray pulse intercepted individual particles of random, unmeasured orientations. This first experimental demon- 
stration of cryptotomography extended the Expansion-Maximization-Compression (EMC) framework to accom- 
modate unmeasured fluctuations in photon fluence and loss of data due to saturation or background scatter. This 
work is an important step towards realizing single-shot diffraction imaging of single biomolecules. 

PACS numbers; 07.05.Kf, 07.05.Pj, 41.60.Cr, 42.30.Rx, 42.30. Wb, 42.55.Vc, 41.50. +h, 61.05.cf, 78.70.Ck 



Single-shot coherent diffraction imaging (CDI) is a devel- 
oping technique aimed at determining the structure of very 
small biomolecules, such as proteins and viruses, not easily 
imaged through more established methods. In one scheme of 
single-shot CDI a serial stream of particles is injected into a 
pulse train of highly coherent and energetic X-ray free elec- 
tron laser (FEL) radiation Jl]]. Photons from a single FEL 
pulse are diffracted when they encounter a particle. Although 
the FEL pulse destroys this particle, its structure is recorded 
in the diffraction data if the pulse is sufficiently short Si • 

As it is difficult to manipulate or determine the orientations 
of very small particles, they are currently injected into the FEL 
radiation at random, unmeasured orientations. Nevertheless, 
sufficiently many 2D diffraction patterns from an ensemble 
of identical albeit randomly oriented particles can in principle 
overdetermine the particle's band-limited 3D Fourier intensi- 
ties. Earlier numerical simulations also show that the parti- 
cle's 3D intensities can be recovered even if the random 2D 
diffraction patterns are remarkably noisy 10-01 • 

This Letter demonstrates the experimental feasibility of 
single-shot CDI with noisy 2D diffraction patterns from 
randomly-oriented identical particles, which we coin crypto- 
tomography. In this first exercise in cryptotomography, we 
reconstruct the 3D intensities of a relatively large and simple 
iron oxide nano-particle. These nearly mono-disperse parti- 
cles are approximately solids of revolution with principal radii 
25 nm and 100 nm (SEM measurements shown in Fig. [T]i. 

Our primary objective is not to study these simple nano- 
particles in any greater detail than what is already available via 
SEM, but to show that cryptotomography, despite its random 



and noisy data, is experimentally viable through the union of 
experimental and theoretical innovations. We do so in the 
style of previous papers in diffraction imaging, which demon- 
strated novel techniques with simple test subjects fllrtl- 

We performed our experiment at DESY's FLASH facility 
with each FEL pulse train comprising 100 pulses (7 nm radia- 
tion; 30 ± 10/im beam focus, extrapolated from 1 81) separated 
by 10 /is, repeated at 5 pulse trains per second. Each detec- 
tor exposure was 1 second long. Additional details on ex- 
perimental parameters and data collection were similar to fill 
and IStllOll respectively, except we used a nebulizer instead to 
aerosolize the nano-particles, which were then directed into 
the FEL radiation using an aerodynamic lens stack 1 11 ] . 

Our experiment imposed two considerable challenges. 
First, the FEL pulse fluence fluctuated due to electron bunch 
dynamics in the lasing process |121. Second, there was con- 
siderable diffuse background scattering (Fig. [T]i from a silicon 
aperture used to shield downstream instruments from beam- 
line scatter. All 5x100 pulses in a single detector exposure 
contributed to this scatter whereas only 1 incident pulse cre- 
ates a useful diffraction pattern if it illuminates a particle. 
This diffuse background changed gradually as the FEL pulses 
eroded the edges of the aperture, exacerbating this issue. 

To overcome the second challenge, we assumed that the 
background scatter added incoherently to the diffraction pat- 
tern from a pulse illuminating a particle. From 2000 diffrac- 
tion patterns (data) we identified those that contained coherent 
scattering (hits), by checking for intensity lobes expected of 
single prolate nano-particles (Fig. [TJ while excluding anoma- 
lous data with scattering from injected debris or those with 
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FIG. 1: (Color online) Top panel shows an SEM image of the pro- 
late iron oxide nano-particles (oblate 3D Fourier intensities) used 
in our experiment. Lower panels show a typical noisy and random 
diffraction pattern from such a nano-particle before (512 x 512 array, 
left) and after processing (91 x 91 array, right): truncated high spa- 
tial frequencies (discarded photons counts outside left white circle); 
binned photon counts; subtracted background; excised non-signal 
pixels (redacted). The logarithm of photon counts are colored ac- 
cording to the inset color bar (max. at pink); negative counts in red. 
The mean dynamic range of photon counts in these post-processed 
data spans 2 orders of magnitude, subject to fluctuations in fluence. 



diffraction patterns from more than one particle. Only non- 
hits without anomalies were considered background data. 

Since the character of the background changed slowly over 
many data-acquisition cycles, for each hit we compared the 
results after separately subtracting five background data ac- 
quired nearest in time. From these five subtractions we se- 
lected the one that gave roughly equal numbers of positive and 
negative photon counts at higher spatial frequencies, where 
the particle's signal is presumably negligible 1 14]. 

Row defects, scattering from the aperture in the multi-layer 
planar mirror |10] and missing photon counts at the lowest 
spatial frequencies were always limited to certain pixels of 
the X-ray sensitive detector These pixels were excised from 
all data — their measurements did not constrain intensity re- 
construction. 2880 (Mpix) usable pixels remained, identical in 
all hits (non-redacted pixels in Fig. [T] lower right). 

Each of the resultant 54 (A/data) background-subtracted hits, 
despite missing information about their orientation (il) and 
fluence (0), are noisy Ewald sphere sections of the particle's 
true 3D intensities. Our goal: to recover the set of 3D intensi- 
ties and fluences most statistically compatible with these hits. 

Our algorithm for recovering the particle's 3D intensities 
10, Eh is based on expectation maximization |13], where we 
iteratively apply a simple rule to increase the compatibility of 
any model intensities {W, even random ones) with all hits. 
Consider the simplest case where we are given only one hit. 
We use a statistical test to determine which Ewald sphere sec- 
tions in W, here on known as tomograms, are most compati- 
ble with this hit, then replace those tomograms with said hit. 



weighted by probabilities taken from the statistical test. This 
prescription on W, which is iterated to a fixed point, deter- 
mines the most likely orientations of this hit with respect to 
an increasingly compatible W. We can generalize this pre- 
scription to many hits by updating the tomograms in consen- 
sus with all hits. This Letter extends our algorithm to also 
determine the fluence distribution of the hits. 

The model 3D intensity W has various representations. It 
can be compactly written as iy(q): the time-integrated scat- 
tered intensity at spatial frequency q when the particle is in 
some reference orientation. We represented iy(q) in our re- 
constructions as a cubic array of floating-point numbers, in- 
dexed by equally spaced samples of q. Detector pixels, la- 
beled by index i, measure Mpix point samples W{qi). 

If we gave the particle some arbitrary orientation from 
the reference position, the intensity recorded at detector pixel 
i is Ty(Rn • qi), where Rq is the orthogonal matrix of this 
rotation. We approximated the continuous il with a discrete 
sampling of M^ot points labeled by the index j (sampling dis- 
cussed in (E]). As a shorthand, we define the intensity at de- 
tector pixel i, after dividing away the incident pulse fluence, 
from diffraction off a particle at approximate orientation j as 
tomogram samples Wij — W{Ilj ■ qi). 

The statistical test central to our expectation-maximization 
algorithm assess the likelihood that measurements on the ith 
detector pixel of the fcth hit {Kik), with photon fluence 
corresponds to tomogram samples of our intensity model Wij . 
Intensity fluctuations at each detector pixel of a particular hit 
due to background-subtraction are assumed to be mutually in- 
dependent. Assuming the noise from background-subtraction 
dominates over Poisson statistics, the likelihood that the /cth 
hit comes from the jth tomogram of W is 

Rjk{W, n, (j)) oc exp I — I . (1) 

The global noise parameter a in eq. (|T|) is the only uncon- 
strained parameter in our algorithm. We seek an ideal a that 
quantifies the true noise in the diffraction data. 

Determining the most likely model parameters (W, il, </>) 
given all hits is unattainable in a single step, hence we adopt 
an iterative procedure that we call Expansion-Maximization- 
Compression (EMC) to implement the model updates • 

In the EMC prescription we first expand (E-step) VF(q) 
into the set of Afrot tomograms Wij. Although this expan- 
sion allows efficient comparisons between hits and model to- 
mograms, it creates redundancy since each intensity sample 
of the 3D model W^(q) is represented in multiple tomograms 
Wij . Such redundancy ameUorates the effects of pixel exci- 
sions when populating M^(q) with data measurements. 

After expanding our model W into tomograms, we max- 
imize (M-step) the data likelihood of each model tomogram 
Wij — > Wij independently. Specifically, we determine the 
new model (W/^ , fij,, 0'^,) which maximizes Eq. condi- 
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tional on probabilities of the current model {Wij , flfc 



argmax N N R,k{W, ft, </)) w, log {Rjk{W', n\ (f>')) . 

(2) 

The numbers wj are weights applied to our rotation group 
samples that approximate a uniform prior distribution [5] . De- 
spite this uniform prior distribution over the rotation group, 
we can still detect orientational biases by evaluating Eq. ^ 
for all hits over the converged W reconstructions. 

The requirement that W^j from different orientations are 
consistent with a single intensity model W^'(q) is enforced in 
the final compression (C-step) by averaging interpolated in- 
tensity samples in all tomograms (details in ||5j]), which repre- 
sent a particular intensity sample in W'{q). We also impose 
Friedel symmetry on VF'(q) since we are operating in the limit 
of weak elastic scattering. This compressed and symmetrized 
model W' (q) is now ready for another round of EMC. 

We exploited a side-effect of EMC's redundant intensity 
representation to find the ideal noise parameter iTmin- If c is 
too small, even though the data likelihood of each updated to- 
mogram is provisionally increased, such tomograms are mutu- 
ally incompatible, thus diminishing the data likelihood of the 
compressed updated intensities. We determined cr,nin know- 
ingly: if (T < (Tmin, the log-likclihood of reconstructions, Eq. 
(|2]l, decline and reconstructions vary dramatically. 

EMC updates of the fluence and intensity model have a re- 
grettable degeneracy: if scaling W'{q) by a multiplicative 
constant increases the model's log-likelihood in (|2|l so will 
a commensurate scaling in cp'. As a consequence, simultane- 
ous EMC updates for 0fe and Wij cannot be decoupled easily. 
However, if we updated only ipk or Wij, while keeping the 
other fixed, the net log-likelihood still increases: 



W' 



J2k ^jkWj Kik/4>k 
J2k RjkWj 



(3) 
(4) 




FIG. 2: (Color online) Comparing reconstructed intensities to those 
of an ideal ellipsoidal particle /ciiip. (a) Cutaway view of choice 3D 
iso-intensity surfaces of a reconstruction which show an oblate inten- 
sity distribution and (b) those of Jciiip with the best R-factor fit to this 
reconstruction; (middle row) mutually perpendicular cross-sections 
of this reconstruction; (bottom row) same cross-sections of /eiiip in 
(b). Logarithm of intensities are shown as hues (color bar in Fig. [TJ. 
Intensities in reconstructions span 3 orders of magnitude. 
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FIG. 3: Probability distribution of reconstructed relative fluence of 
the hits (expected distributions studied in 1 12]), a result of fluctua- 
tions in pulse fluence and the positions of particles when illuminated. 



The likelihood Rjk in the last two equations is evaluated 
at the current model parameters {W, il, (p). We imposed 
RjkWj = 1 during each EMC iteration to assert that every 
hit (index k) must be found at some orientation (index j). 

We reconstructed 3D Fourier intensities from random starts 
using EMC with only diffraction data while imposing Friedel 
symmetry since these are the minimal constraints expected 
in future cryptotomography experiments. We later evaluated 
each converged reconstruction (Figs. ^ by fitting them to in- 
tensity distributions of ideal ellipsoidal particles /eiiip. 

We began each reconstruction with random intensities 
iy(q), represented by random numbers on a cubic array. In- 
tensities were reconstructed only in the range q,j^in < |q| < 
(Zmax (non-redacted regions in 2D sections of Fig. |2]) with 
(Zmin/^max = 20/44 — determined from low spatial frequen- 



cies missing-signal region and maximum scattering angle of 
5.23° in processed hits (white circle in Fig. [T] lower left). 

We normalized each random initial M^(q) to have tomo- 
grams that matched the mean power received per hit; initial 
fluences were set to J^i Kik\ we used a Aii-ot — 3240 to- 
mographic representation of our model (Afrot sufficiency dis- 
cussed in [5]). The noise parameter a was measured in units 
of A A-: the square root of the sum of variances in measured in- 
tensities of each pixel |15]. We determined fTmin = 0.07 Aj^, 
where reconstructions below this showed diminished likeli- 



hood and significant diversity ||16|] . 

We could in principle recover the nano-particle's real-space 
contrast from the reconstructed 3D intensities W via phase re- 
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FIG. 4: (Color online) We superimpose the most likely orientations 
of the nano-particle symmetry axis in the 54 hits found in each of 
10 reconstructions in this azimuthal projection (marked by equidis- 
tant rings of constant latitude in this top hemisphere). Orientations 
are inversion-symmetric to those in the complementary bottom hemi- 
sphere. Many orientations coincide. The vertical dashed line shows 
the detector plane; PEL pulses traveled with the red arrow; nano- 
particles were injected into the page (blue crosshair). 



trieval |7], but there will be pixellation effects (38.3 nm half- 
resolution in processed hits). Instead we determined the prin- 
cipal radii of the nano-particle from our reconstructed inten- 
sities using R-factor comparisons jnll with those of an ideal 
ellipsoidal particle /eiiip (q). The particle's principal radii were 
found to be 30.5 ± 0.8 : 30.6 ± 0.7 : 76.1 ± 1.1 nm with R- 
factor 0.093±0.006 (0. 162±0.003 when reconstructions were 
compared to the intensity function expected of the SEM parti- 
cle measurements) [18 1. We expect a larger error in the longer 
direction of the prolate nano-particle since it corresponds to 
the compact direction of its oblate intensities, which was more 
susceptible to background noise. Fig. [3] shows the concomi- 
tant fluence distribution we reconstructed. 

Fig. |4] shows the most likely orientations of the particles 
coiTesponding to the 54 hits used in intensity reconstruction 
11191]. Reconstructed orientational bias in the data could arise 
from either systematic effects in particle delivery or biases 
during hit-selection. If a nano-particle's axis of symmetry is 
coUnear with the incident direction of PEL pulses its diffrac- 
tion pattern will not have identifiable lobes (unlike Fig. [T]i. Al- 
ternatively, data may contain intensity lobes but are obscured 
by the redacted pixels or background noise. Such data might 
have been missed during hit-selection. 

Despite the simplicity of our nano-particle, we emphasize 
that reconstructing its 3D intensities, using only diffraction 
data, is non-trivial primarily because of ambiguities from 
unmeasured data orientation and fluence. These ambigui- 
ties make direct interpretation of the data very challenging. 
EMC circumvents these ambiguities without prior assump- 
tions about the intensity distribution beyond Friedel symme- 
try, enforcing only simple statistics, Eq. ((T), to determine that 
the particle was prolate instead of oblate. Subsequent R-factor 
fits of converged EMC reconstructions also gave reasonable 



particle dimensions, corroborating the effectiveness of EMC 
on experimental cryptotomography data. 
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